An Ontology-Driven Framework for Data Transformation in Scientific Workflows

نویسندگان

  • Shawn Bowers
  • Bertram Ludäscher
چکیده

Ecologists spend considerable effort integrating heterogeneous data for statistical analyses and simulations, for example, to run and test predictive models. Our research is focused on reducing this effort by providing data integration and transformation tools, allowing researchers to focus on “real science,” that is, discovering new knowledge through analysis and modeling. This paper defines a generic framework for transforming heterogeneous data within scientific workflows. Our approach relies on a formalized ontology, which serves as a simple, unstructured global schema. In the framework, inputs and outputs of services within scientific workflows can have structural types and separate semantic types (expressions of the target ontology). In addition, a registration mapping can be defined to relate input and output structural types to their corresponding semantic types. Using registration mappings, appropriate data transformations can then be generated for each desired service composition. Here, we describe our proposed framework and an initial implementation for services that consume and produce XML data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards a model-driven transformation framework for scientific workflows

Scientific workflows evolved to a useful means in computational science in order to model and execute complex data processing tasks on distributed infrastructures such as Grids. Many workflow languages and corresponding workflow engines and tools were developed to model and execute scientific workflows, without using established workflow technologies from the business domain. With the adoption ...

متن کامل

Collaborative Data-centric Workflows: Towards Knowledge centric workflows and Integrating Uncertain Data

The acquisition of data, in particular for scientific data, is more and more organized in complex processes that are captured by workflows. These workflows are often driven by ontologies. For example the collaborative application Spipoll [3] proposes to collect information about pollination in France. The users take pictures of insects on flowers, download them on the application and then ident...

متن کامل

An ontology-based framework for bioinformatics workflows

The proliferation of bioinformatics activities brings new challenges - how to understand and organise these resources, how to exchange and reuse successful experimental procedures, and to provide interoperability among data and tools. This paper describes an effort toward these directions. It is based on combining research on ontology management, AI and scientific workflows to design, reuse and...

متن کامل

Dynamic configuration and collaborative scheduling in supply chains based on scalable multi-agent architecture

Due to diversified and frequently changing demands from customers, technological advances and global competition, manufacturers rely on collaboration with their business partners to share costs, risks and expertise. How to take advantage of advancement of technologies to effectively support operations and create competitive advantage is critical for manufacturers to survive. To respond to these...

متن کامل

Semantic validation and correction of scientific workflows

Scientific workflows describe steps for orchestrating the execution of a network of computational operators toward some goal, such as data transformation for analysis or visualization. Typically, these operators consume and emit transformed data, or cause some effect. In most scientific workflow systems, the operators are typed to enable compatibility checks for their composition that make up a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004